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ARTICLE INFO ABSTRACT 

Keywords: Fly ash is a by-product almost found in coal power plants; it is available worldwide. According to the hazardous 
Artificial neural network impacts of cement on the environment, fly ash is known to be a suitable replacement for cement in concrete. A lot 
ae of carbon dioxide (CO?) is released during cement manufacturing. The investigations estimate that about 8 — 10% 
si: s of the total CO2 emissions are maintained by cement production. Since fly ash has nearly the same chemical 


compounds as cement, it can be utilized as a suitable alternative to cement in concrete (green concrete). The 
current study analyzes the effect of the quantity of the two main components of fly ash, CaO, and SiO», on the 
compressive strength of concrete modified with different fly ash content for various mix proportions. For this 
purpose, various concrete samples modified with fly ash were collected from the literature (236 datasets), 
analyzed, and modeled using four different models; Full-quadratic (FQ), Nonlinear regression (NLR), Multi-linear 
regression (MLR), and Artificial neural network (ANN) model to predict the compressive strength of concrete 
with different geometry and size of the specimens. The accuracy of the models was evaluated using correlation 
coefficient (R2), Mean absolute error (MAE), Root mean squared error (RMSE), Scatter Index (SI), a-20 index, and 
Objective function (OBJ). According to the modeling results, increasing SiOz (96) increased the compressive 
strength, while increasing CaO (96) increased compressive strength only when the cement replacement with fly 
ash was between 52 and -10096. Based on R?, RMSE, and MAE, the ANN model was the most effective and 
accurate on predicting the compressive strength of concrete in different strength ranges. According to the 
sensitivity analysis, curing time is the most critical characteristic for predicting the compression strength of 
concrete using this database. The primary objective of this work is to explore and evaluate various machine- 
learning models for predicting compressive resistance. The study emphasizes these models’ development, 
comparison, and performance assessment, highlighting their potential to predict compressive strength accu- 
rately. The research primarily uses machine learning, leveraging algorithms and techniques to build predictive 
models. The focus is on harnessing the power of data-driven approaches to improve the accuracy and reliability 
of compressive strength predictions. 


Fly Ash 
Soft Computing 


1. Introduction world [1]. Cement is an important factor that leads to greenhouse gases 
and global warming [2]. The key variables that lead to social and eco- 
Cement production provides 8 to 10 96 of the CO» emissions in the nomic changes are industrialization, urbanization, globalization of the 
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economy, market and consumerism, and population increase. Overall, 
climate change is a clear result of these factors resulting in dangerous 
issues for humanity [3]. The rapid development of globalization and 
population growth has caused an increase in building construction [4]. 
The construction business is growing exponentially, resulting in an 
increasing demand for construction materials, especially cement [5]. 
Nowadays, the construction industry is one of the largest environmental 
impacts among all human activities [6]. The negative environmental 
impacts associated with the widespread use of cement have encouraged 
researchers to find a satisfactory solution for reducing the demand for 
cement. As the greatest strategy to reduce CO» emissions is to replace 
cement with a suitable substitute in percentage, several studies have 
been done to find viable replacements for cement in concrete mixtures. 
Supplementary cementitious materials (SCMs) like fly ash [7], silica 
fume [8], ground granulated blast furnace slag [9], rice husk ash [10], 
and metakaolin [11] can be added to the concrete as a partial replace- 
ment. Fly ash is a widely available material worldwide [12]. It is a very 
fine powder with a fineness more than cement. Fly ash is commonly a 
by-product material found in coal industries and collected in a baghouse 
or on an electrostatic precipitator. Fly ash is one of the best alternatives 
for cement in concrete; it provides excellent mechanical properties [13]. 
Utilizing fly ash in concrete economizes the mix, protects the environ- 
ment from disposal, conserves more land, and reduces cement produc- 
tion; overall, it can improve the properties of green and hardened 
concrete [14-16]. 

Fly ash is a pozzolanic material rich in aluminous and siliceous 
substances. Generally, the chemical compositions and their ratios are 
source-dependent. According to the ASTM C619 [17], fly ash is divided 
into two classes based on the quantity of chemical compositions; class C 
and F [18,19]. Both classes of fly ash can be added to concrete in a 
specific range to enhance mechanical properties and durability. 
Numerous studies have been conducted on cement-based concrete 
modified with different types, classes, and quantities of fly ash. The 
improvement greatly depends upon the type and ratio of fly ash. The fly 
ash type correspondence to the number of chemical substances and their 
impact on the concrete properties. Among the chemical compositions of 
fly ash, the effect of various silicon dioxide (SiO2) and calcium oxide 
(CaO) ratios on the mechanical properties of concrete has been inves- 
tigated in previous studies. Class C of fly ash provides a greater 
compressive strength value for the same cement dosage and fly ash 
percentage than class F [19]. Fly ash with a higher CaO content tends to 
have a higher reactivity [20]. The fly ash compounds such as CaO at 
early ages and SiO» at later ages are associated with the strength gain of 
concrete [21]. 

Compressive strength is a significant mechanical property in con- 
crete considered in all concrete structure works. Concrete is a hetero- 
geneous material comprising different substances, such as cement, 
aggregate, water, and admixtures, resulting in different compressive 
strength values [22]. All the physical, chemical and mechanical char- 
acteristics of these ingredients and their quantity impact the compres- 
sive loading capacity of concrete [23]. The compressive strength value is 
achieved in the laboratory during the testing of the concrete samples by 
crushing the cylinders or cubes of standard dimension [24]. The 
experimental method is standardized all over the world. However, 
nowadays, laboratory tests are inefficient and uneconomical since it is 
costly and time-consuming. Recently, with the advancement of Artificial 
Intelligence (AD, Machine learning algorithm (ML), has been applied to 
predict several mechanical properties in concrete [25]. ML techniques 
such as regression, clustering, and classification can estimate various 
factors with varying efficiency and accurately forecast concrete 
compressive strength value [26]. Modeling the properties of concrete 
can be developed in various ways, including statistical techniques, 
computational modeling, and newly created tools like artificial neural 
networks (ANN), regression analysis, and the M5P-tree model [27-31]. 
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In the current study, 236 experimental values of compressive 
strength have been collected from the literature. The concrete mixtures 
included fly ash content (FA) ranging (from 71 — 316 kg/m?), CaO (0.31 
— 32%), SiOz (30.5 — 62.54%), cement content (C) (67 — 356 kg/m?), 
cement replacement (CR) (18 — 100%), coarse aggregate (CA) (801 — 
1246 kg/m?), fine aggregate (S) (522 — 905 kg/m?), water-to-binder 
ratio (w/b) (0.28 — 0.6), and curing time (t) (3 — 365 days). As well as 
the compressive strength (CS) ranged from 7.98 to 92.93 MPa. To check 
the reliability and applicability of the models, CS is divided into three 
different ranges; 5 to 34 MPa, 35 to 64 MPa, and 65 to 95 MPa. Then the 
most effective model is applied to those ranges. This study proposes 
models to predict compressive strength values using different analyzing 
approaches. The influence of CaO and SiO? contents at various mix 
proportions on the compressive strength of cement-based concrete 
modified with fly ash was studied and qualified in this study. In the 
modeling part, full quadratic (FQ), nonlinear (NLR), multi-linear (MLR), 
and artificial neural network (ANN) models are used to develop pre- 
dictive models. In addition, different statistical tools such as Correlation 
coefficient (RS), Root mean squared error (RMSE), Mean absolute error 
(MAE), Scatter index (SI), Objective function (OBJ), and an a20-index 
are used to evaluate the reliability and accuracy of the developed 
models. 


2. Research significant 
The following are the general objectives of this study: 


i. Statistically analyzing the collected data and evaluating the effect 
of CaO and SiO» on the compressive strength of fly ash-modified 
concrete, such as coarse aggregate, fine aggregate, cement, 
cement replacement, water-to-binder ratio (w/b), fly ash, cal- 
cium oxide (CaO,%), silicon dioxide (SiO2,%), and curing time. 

ii. Creating an accurate and reliable model for predicting the 
compressive strength of fly ash-based concrete. 

iii. Conducting sensitivity analysis to determine the most important 
parameter in predicting the compressive strength of concrete 
modified with various types and classes of fly ash. 


3. Methodology 


The methodology of the current study includes a series of steps which 
has been drawn as a flowchart and shown in Fig. 1. The main steps 
involve the following; 


i. Creating and collecting data on the cement-based concrete in 
different strength ranges of (5 — 34 MPa), (35 - 64 MPa), and (65 
— 95 MPa) from the literature. 

ii. Considering coarse aggregate (CA), fine aggregate (S), cement 
content (C), fly ash content (FA), cement replacement (CR), 
water-to-binder ratio (w/b), calcium oxide (CaO), silicon dioxide 
(SiOz), and curing time (t) as predictors for the models whereas, 
the compressive strength (CS) is considered as a target value of 
the models. 

iii. Randomly mixing the collected data into two groups, 7096 of the 
data as training and 30% as testing. 

iv. Developing predictive models using Full Quadratic (FQ), 
Nonlinear Regression (NLR), Multi-linear Regression (MLR), and 
Artificial Neural Networks (ANN) models. 

v. Evaluating the proposed models based on R?, RMSE, MAE, OBJ, 
SI, and the a-20 index. 

vi. Conducting sensitivity analysis to find the essential parameter in 
predicting compressive strength for the concrete modified with 
fly ash. 
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Applying 
Testing 
dataset on the 
developed 
models 


Fig. 1. Methodology of Research; ANN, artificial neural network; FQ, full quadratic; NLR, nonlinear; MLR, multi-linear; R?, correlation coefficient; RMSE, root mean 


squared error; MAE, mean absolute error; SI, scatter index; OBJ, objectives. 


3.1. Data collection 


To develop a reliable model to predict the compressive strength of 
concrete modified with fly ash, 236 data were collected from the liter- 
ature, including various fly ash ratios with class F and C, resulting in 
different ratios of main chemical compositions such as; CaO and SiO». As 
well as different coarse and fine aggregate content, cement content, and 
the curing regime was involved. Table 1 shows a summary of the 
collected data. A total of 236 data were selected in this study, and based 
on the information from the literature [32-34], the data were divided 
into two groups using RAND Function. 

Training data was included;0.70*No.ofdataset(236) = 165 Data. 

The training data was used to develop the models. 

Tested data; 0.30*No.ofdatatset(236)= 71 Data The testing dataset 
was used to validate (Check) the developed equations based on 165 
datasets. 


3.2. Pre-processing 


Pre-processing is required before using the dataset. Pre-processing 
techniques improve the performance of computational models. As a 
result, all discontinuous features (independent variables) were con- 
verted to dummy variables using Pandas, an open-source Python pack- 
age. As demonstrated in the box plot (Fig. 2), some features in the 
created dataset had values that spanned an exceptionally wide range, 
which was incompatible with model learning. Therefore, the average of 
the upper and lower bounds was employed instead. Furthermore, using 


the original non-normalized data may have influenced the ML model's 
performance [35]. This study pre-processed the database by converting 
each independent variable to a value between zero and one using the 
following equation, Eq. (1). 


_ (Xi — Ximin) (Xpmax — X;min) 
(Ximax — X;min) 


where; X; is the old value and X; is the new value. The X;min is zero and 
Xymax is one. 


3.3. Statistical analysis 


The collected data is statistically analyzed to determine the distri- 
bution of each independent variable with the dependent variable 
(compressive strength). Therefore, statistical parameters such as Mean, 
Standard Deviation (SD), Variance (Var), Kurtosis (Kur), Skewness 
(Skew), minimum (Min), and Maximum (Max) are obtained. For kur- 
tosis, the negative value indicated a short distribution tail, while the 
positive indicated a longer tail. However, skewness shows the distribu- 
tion of the variable, whether in the right or the left tail, with the right for 
positive and the left for negative values. 

The relation between compressive strength and the mentioned in- 
dependent variables is plotted in Fig. 3. The compressive strength data is 
shown in a frequency histogram in Fig. 4. The statistical analysis is 
summarized in Table 2. 
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Table 1 
Summary of collected data for cement-based concrete modified with fly ash. 
Reference No. Coarse Sand, S Cement, C Fly ash Cement Water-to- CaO (%) SiOz (96) Curing Compressive 
of aggregate, (kg/m?) (kg/m?) (kg/m?) replacement, binder time, t strength, CS 
Data CA (kg/m?) CR (96) ratio, (w/ (Days) (MPa) 
b) 
58] 53 801—1090 687-905 144-356 92-168 35-100 0.41-0.60 2.32 52.1 3, 7, 14, 11.78-45.99 
28, 56 and 
91 
15] 60 1270 522-671 160-320 80-240 20-60 0.30-0.40 1.54 62.54 7, 28, 56, 16.99-92.93 
90,180, 
256 and 
365 
59] 42 1016-1207 607-854 107-328 71-316 18-74 0.3-0.4 27.6-31.9 30.5-34.9 3, 7, 28, 15.4-72.4 
90 and 
365 
60] 13 1089-1124 726-749 179-182 219-222 55 0.3-0.4 0.3-15.8 36.9-56.8 3, 7, 28, 21.2-44.6 
56, and 91 
61] 9 843-864 842-866 161-247 155-254 40-60 0.4-0.5 13.4 52.4 7 and 28 15.6-48.3 
62] 23 1200 600 120-200 200-280 50-70 0.3-0.4 2.4-2.6 44.9-50.2 3, 7, 28, 7.98-81.54 
90, 180, 
and 365 
63] 15 1181-1246 600-819 66.9-200 167-280 50-80 0.28-0.40 2.6-26.3 35.2-50.2 7, 14, 28, 16-67.5 
90, 180 
and 365 
64] 17 1095-1195 740-797 150-274 117-204 30-52 0.4-0.5 4.2-13.4 42.7-52.4 7, 28, 56, 25.5-42.5 
and 96 
65] 4 1114-1135 727-740 157-238 159-236 40-60 0.3 2.9 52 14 and 28 38-62 
Remark 236 Ranged Ranged Ranged Varied Ranged Varied Ranged Ranged Ranged Ranged 
between 801 between between between between 18 between between between between 3 between 7.98 
and 1200 522 and 107 and 71 and and 100 0.28 and 0.3 and 30.5 and and 365 and 92.93 
905 356 316 0.6 31.9 62.54 


3.4. Specimen sizes 


The specimen parameters such as size and shape are important fac- 
tors in the compressive strength of concrete. Those factors were inves- 
tigated in the normal and high-strength concrete in the previous study 
[36,37]. In the current study, the database utilized to create the models 
comprised test samples of different sizes and shapes for assessing 
compressive strength. Standard cylinders with S1 (100 x 200) and S2 


0.8 


0.6 
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0.4 


ompressive s 


C 
= 
ba 
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Bl CaO*6 


E 510256 


B Sand (kg/m) 


(150 x 300) mm, and cubes with S3 (100 x 100 x 100) and S4 (150 x 
150 x 150) mm were included. 

The samples were all utilized to create predictive models. Using 
different models, the constructed models were evaluated using various 
specimen sizes to determine the efficiency of predicted compressive 
strength. 3996 of the data were S1, and 1896, 4196, and 296 were S2, S3, 
and S4, respectively. The total number for all specimen sizes in the 
training and testing datasets is shown in Fig. 5. 


E Cement (kg/m?) 


Bl Cement Replacement (%6) E| Water-to-binder ratio 


Curing time (days) 


Fig. 2. Box plot of the independent variables. 
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No. of Data= 236 
Mean- 1132.2 kg/m? 
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Fig. 3. Marginal plots for the compressive strength of concrete with (a) coarse aggregate (kg/m?) (b) fine aggregate (kg/m?) (c) cement (kg/m?) (d) fly ash (kg/m?) 
(e) cement replacement (96) (f) water-to-binder ratio (g) CaO (96) (h) SiOz (96), and (i) curing time (days). 
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No. of Data-236 
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Fig. 3. (continued). 
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Fig. 3. (continued). 
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No. of Data=236 
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Fig. 3. (continued). 
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Fig. 4. Histogram for compressive strength (MPa) of concrete modified with fly ash. 
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Table 2 
Summary of statistical analysis of the concrete parameters. 
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Variables Statistical parameters 
Mean Standard deviation Variance Kurtosis Skewness Min Max 
CA (kg/m?) 1132.2 120.73 14575.73 —0.265 —0.666 801 1270 
S (kg/m?) 709.1 100.57 10114.55 —1.2776 —0.0785 522 905.4 
C (kg/m?) 211.2 59.809 3577.099 —0.432 0.240 67 356 
FA (kg/m?) 173.7 58.262 3394.479 —0.827 0.278 71 316 
CR (96) 49.9 15.506 240.440 0.264 0.119 18 100 
w/b 0.39 0.083 0.007 —0.324 0.732 0.28 0.60 
CaO (96) 8.75 10.929 119.444 —0.252 1.232 0.31 31.9 
SiO; (96) 49.42 10.755 115.676 —0.890 —0.463 30.5 62.54 
t (Dyas) 86.9407 110.848 12287.359 1.071 1.520 3 365 
CS (MPa) 40.75 18.52 342.98 0.0174 0.77 7.98 92.93 
B Training © Testing 
S1= 100x200 mm 
$2-150x300 mm 
S3=100x100x100 mm 
S4= 150x150x150 mm 

Pa 

Q 

= 

© 

= 

cz 

o 

= 

As 

Specimen Sizes 
Fig. 5. A summary of the sample sizes utilized in training and testing for developing models. 
3.5. Modeling 


The correlation between dependent and independent variables es- 
tablishes a direct relationship between the concrete mixture proportion 
and compressive strength. The datasets were randomly divided into 
training and testing datasets [38,39]. The training dataset is used to 
develop the models, whereas the testing dataset checks those developed 
models against unobserved data during training. Based on the correla- 
tion matrix, a poor correlation between independent variables such as 
CA, S, C, FA, CR, w/b, CaO, and SiO; and the dependent variable which 
is CS is observed as shown in Fig. 6. The correlations are 0.47, —0.30, 
0.40, —0.23, —0.53, —0.58, —0.08, and —0.34, respectively. However, 
an acceptable correlation coefficient between CS and t is noticed by 
0.75. Therefore, different multivariable models are advanced in the 
following section to establish an analytical model to predict the 
compressive strength of concrete. 


3.5.1. Full quadratic (FQ) model 

Equation (2) shows the full quadratic formula, representing a rela- 
tionship between compressive strength, the first and second degrees of 
each independent variable, and the interaction between independent 
variables [32,40-44]. This model is found to have a complex combina- 
tion of mathematical terms [45,46]. 


CS = P, + B (CA) + PCS) + By(C) + Bs(FA) + Bo(CR) +B, (7) 

+ Ps(Ca0) + By(SiO2) + Proli) + B (CAYGS) + Bi2(CA)(C) 

+ Pis(CA)(FA) + f (CA) (CR) + Bis (CA) (7) + Big(CA)(Ca0) 

+ By(CA)(SiO2) + Byg(CA)(0) + iy (5) (C) + Bas (S) (FA) 

+ Bai (S) (CR) + ox (S) (Z) + Bas() (CaO) + fs (S) (S05) + fos (SY) 
+ bas (C (FA) + Pa (C (CR) + BoC) (Z) + Ba9(C) (CaO) 


+ Bp(C)(SiOs) + Ps (C) (1) + Bas (FA) (CR) + Ps (FA) (7- 


+ B4,(FA)(CaO) + B; (FA) (SiOz) + B,(FA)(t) + B37(CR) *) 
+ Bss (CR) (Ca0) + Bsy(CR) (S102) + Ja (CR) (1) + Ba (Z) (CaO) 


Be) (S102) + Bas (7) (0 + Bua(CaO)(SiO2) + fas (CaO)() 


2 
+ fs; (2) + Bs3 (CaO)? fs, (SiO;)* + Pss (t) 
(2) 


CS, CA, S, C, FA, CR, w/b, CaO, SiO», and t are compressive strength, 
coarse aggregate content, fine aggregate content, cement content, fly 
ash content, cement replacement, water-to-binder ratio, calcium oxide 
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gu. 


w/b CaO SiO, t CS 


Fig. 6. Correlation matrix plot between the dependent and independent variables of concrete modified with fly ash. 


ratio, silicon dioxide rato, and curing time, respectively. Furthermore, 
the ff, to P55 are model parameters. 


3.5.2. Nonlinear regression (NLR) model 

The following formula is a typical form to develop a nonlinear 
regression model [32,47,48] for predicting the compressive strength of 
concrete containing different types of fly ash with various quantities of 
CaO and SiOz, Eq. (3). NLR model has a disadvantage which is the 
complex mathematical operation [49]. 


w 


CS = a, (CA)® + a3(S)™ +as(C)“+ar( FA)“ +a (CR) +a G 


CS, CA, S, C, FA, CR, w/b, CaO, SiO», and t are compressive strength, 
coarse aggregate content, fine aggregate content, cement content, fly 
ash content, cement replacement, water-to-binder ratio, calcium oxide 
ratio, silicon dioxide ratio, and curing time, respectively. The o to og is 
defined as the model parameters. 


3.5.3. Multi-linear regression (MLR) model 

The compressive strength of the concrete containing different CaO 
and SiO» ratios is predicted using the multi-linear regression model. 
Equation (4) represents the product of the variables affecting the 
compressive strength of the concrete in exponential and constant terms. 
The MLR model has some advantages, such as simple mathematical 
operation and easy implementation resulting in more accurate results 
compared to the LR model. However, the MLR model greatly depends on 
the number of data; fewer number provides less accuracy. Therefore, 
this model has poor quality [32]. 


C$ = a (CA (S^ (C (FA (CR) (7) (cao)^(sio"Qy" A 
Where a;toa1o are defined as the model parameters 
B, = a (CA) + ba (S) + 6y(C) + d (FA) + e (CR) + fa (7) + 8. (C20) 
b (5a) 
+h, (SiOz) + in(t) +b 
Nodel Node2 Node, 
Gis En EE Threshold (5b) 


J^ + ais (Ca0)™ +a1s(Si02) + a (H 
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3.5.4. Artificial neural network (ANN model) 

The Artificial Neural Network (ANN) is a computational system that 
analyzes data like the human brain. This model is also a machine 
learning system used in construction engineering for various numerical 
forecasts and challenges [27,29,31,42]. ANN is made up of three layers; 
input, hidden, and output, which are linked by biases and weights 
[44,50], Eq. (5). This study designed a multi-layer feed-forward network 
with concrete components (CA, S, C, FA, CR, w/b, CaO, SiO», and t) as 
input and CS as output. In the output layer, a sigmoid activation function 
is utilized. 


(3) 


(5) 


Output = f ( » WiXj + 2 


j is the number of input variables, x; is the input parameter, and bias is 
the threshold for the sigmoid activation function. 

Fig. 7 shows the typical process of the ANN result. As well as the 
typical procedure for ANN calculation in a single hidden layer is illus- 
trated in Eq. (5). 

The Nodel to Node,, and Threshold are directly calculated. 


3.6. Metrics to evaluate developed models 


The developed models are evaluated using different parameters such 
as correlation coefficient (RI, root mean squared error (RMSE), mean 
absolute error (MAE), scatter index (SI), objectives (OBJ), and a20- 
index. The values of the a20-index are predicted to be unity for a per- 
fect predictive model. The proposed a20-index has the advantage of 
having physical engineering meaning; it declares the number of samples 
that satisfy expected values with a 2096 variance from experimental 
values [51]. The following equations (6) — (11) are utilized to determine 
each mentioned criterion [46,52—54]. 
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Fig. 7. Typical ANN output technique in a single node. 


R= Daily = x (yi — y) - (6) 
"S -7P x ren 
(7) 
MAE = Zei — spl) (8) 
n 
RMSE 
gu (9) 
yi 
Ny, RMSE, + MAE, Ds RMSE; + MAE 
BJ = 1 
id Es Ri +1 ) " (s Rọ +1 ) me 
a20 — index — ve (11) 


where; 

yp = predicted value of compressive strength. 

yi = experimental value of compressive strength. 

x = mean of predicted compressive strength values. 

y = mean of experimental compressive strength data. 

Nyy = number of the training dataset. 

ni, = number of the testing dataset. 

Ngy = a total number of datasets, including training and testing. 

N — total number of data in the dataset. 

N20 — total number of predicted to the experimental data of 
compressive strength ratio ranged from 0.8 to 1.2. 

The value of R? and a-20 index is commonly between (0to1), and the 
best value is considered 1. The RMSE, MAE, and OBJ values are between 
(Otooo), the value is recommended to be low as possible, and zero is the 
best value. As well as, if the value of SI was smaller than 0.1, the model is 
qualified as an excellent performance. Whereas the SI value is between 
(0.1 to 0.2), (0.2 to 0.3), and greater than 0.3, indicating the good, fair, 
and poor performance of the model, respectively [44,55]. 
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4. Results and discussion 
4.1. Variation of predicted and measured compressive strength 


4.1.1. FQ model 

The full quadratic model is one of the most effective models due to its 
advanced mathematical expression. This model was used to predict the 
compressive strength of concrete modified with various fly ash types. It 
was derived based on mathematical parameters such as constant, linear, 
variable product terms/interaction, and squared variables. Equation 
(12) shows the formula for the FQ model for predicting the compressive 
strength of concrete. The correlation between predicted and measured 
compressive strength for the FQ model is illustrated in Fig. 8. The value 
of R? and RMSE was 0.94 and 4.627 for training and 0.887 and 4.57 for 
testing data, respectively. The training dataset contains an error line of 
15 to —1596, meaning that 85% of the data falls between 0.85 and 1.15 
for the predicted to measured compressive strength ratio. 


CS = —126 —0.17(CA) — 0.03(S) + 0.9(C) — 0.35(FA) — 1.36(CR) 
$ 1.2(F) — 2.8(CaO) +5.46(SiO») +0.4(1) + 0.0002(CA)(S) 
— 0.0002(CA)(C) — 0.0002(CA)(FA) + 0.001(CA)(CR) 
—0.05(CA) (2) + 0.0001(CA)(CaO) — 0.003 (CA) (SiO2) 
— 0.0003(CA)(t) — 0.0004 (S) (C) — 0.001 (S) (FA) + 0.00003(S)(CR) 
— Q.19(S) (2) +0.003(S)(CaO) + 0.0012(S)(SiO;) — 0.0004(S)(r) 
+0.002(C)(FA) — 0.0003(C)(CR) — 0.18(C) (2) — 0.0007 (C)(CaO) 
— 0.0097 (C)(SiO;) — 0.0002 (C)(r) + 0.004(FA)(CR) +0.48(FA) (2) 
CaO) — 0.005(FA)(SiO;) — 0.001 (FA) (1) 
—0.16(CR) (s +0.04(CR)(CaO) + 0.002 (CR)(SiO;) 

) 


) 
) 


ae 


— 0.004(FA 


+0.003(CR)(t) + 1.95 (2) (CaO) -- 0.04 (2) (SiO) — 0.26 (2) (f) 
— 0.03 (Ca0) (SiO2) + 0.001 (CaO)(1) + 0.6(SiO;) (f) 
+ 0.0001 (CA)? — 0.000002(S)? — 0.0005(C)^-- 0.001(FA)? 


2 
— 0.0003(CR)? + 0.0001 (7). 0.0001 (CaO)* 


+ 0.0004 (SiO;)? — 0.0003 (1)? 
(12) 
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Fig. 8. Comparison between measured and predicted CS using FQ model for training and testing dataset. 


No. of training dataset — 165, R? = 0.94, RMSE = 4.627 MPa. 


4.1.2. NLR model 

The nonlinear regression model was also used to predict the 
compressive strength of fly ash-modified concrete. The NLR model result 
is shown in Eq. (13). The relationship between predicted and measured 
compressive strength is displayed in Fig. 9. The training dataset has an 
R? and RMSE value of 0.816 and 8.36 MPa. Also, an R? of 0.47 and RMSE 
of 9.62 MPa for the testing dataset were observed. According to the 
RMSE results, the NLR model performance is fair. The FQ model results 


y 


—0.05 
CS— 13 (cays (5)°35 (0 (FA) (coy ^ (2) (cao)? (sio, 4 (gj onno 


provide better performance than the NLR model. The error line in the 
training dataset is 30 to —30%, which means that 70% of the data falls 
between 0.7 and 1.3 for the predicted to measured compressive strength 
ratio. 


CS = 140.66 (CA) °°? +.0.68(8) ^" +1.06(C)P ? — 0.041 (FA)? — 3.096(CR) **" + 13.94( 


No. of training dataset = 165, R? = 0.816, RMSE = 8.36 MPa. 


4.1.3. MLR model 
The multi-linear regression model is another model utilized to pre- 
dict the compressive strength of concrete modified with fly ash. It is 
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counted as one of the least effective models due to its elementary 
mathematical expression. The MLR model formula consists of constant 
terms with terms that rose to the power of constant variables. The 
following equation presents the variables and their relationship in 
detail; Eq. (14). The relation between predicted and measured 
compressive strength is illustrated in Fig. 10. The R? and RMSE were 
0.79 and 9.22 for training, 0.61 and 8.25 for testing data, respectively. 
The error line in the training dataset is 25 to —2596, which means that 
7596 of the data falls between 0.75 and 1.25 for the projected to 
measured compressive strength ratio. 


(14) 


No. of training dataset = 165, R? — 0.79, RMSE — 9.22 MPa. 


Ww 


-1.14 
5) — 26.95 (CaO) °° — 2»7 (SiO) °° +5.32 (N° 


(13) 


4.1.4. ANN model 

The Optimum ANN network structure is the last model used to pre- 
dict the compressive strength of concrete. Five trials were selected 
depending on the RMSE and MAE values, as shown in Fig. 11. Among 
those five trials, the lowest RMSE and MAE values were observed in the 
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Fig. 9. Comparison between measured and predicted CS using NLR model for training and testing dataset. 


network containing one hidden layer with 19 neurons. The ANN 
network having one hidden layer with 19 neurons with 0.1 of mo- 
mentum, a learning rate of 0.2, and 2000 learning time, is chosen, 
Fig. 12. The ANN formula, including weights and biases, is shown in Eq. 
(15). Fig. 13 shows the relation between predicted compressive strength 
by the ANN having 19 neurons with the experimental compressive 
strength values for both training and testing data. ANN network analysis 
has an R? of 0.987 and RMSE of 3.3 MPa when the training data are 
applied. The model provides an R? of 0.986 and RMSE of 3.52 MPa when 
testing data is used. The training dataset has an error line of 10 to — 1096, 
indicating that 9096 of the data are laid between 0.9 and 1.1 for the ratio 
of predicted compressive strength to measured compressive strength. As 
well as Fig. 13 shows the variation between measured and predicted 
compressive strength values plotted for different strength ranges based 
on the training dataset. The highest value of R? was found in the CS of 65 
to 95 MPa. 


0.1 0.02 
loe lteh 


No. of training dataset — 165, R? — 0.987, RMSE — 3.3 MPa. 


0.15 
l4 e^ 


0.95 


CS "om 


7- 0.475 


(15) 


4.2. Effect of CaO and SiOz content on the compressive strength of 
concrete 


The FQ model was used to examine the influence of cement 
replacement percentage, CaO (96), and SiOz (96) on cement-based con- 
crete's compressive strength using a fly-ash modified concrete dataset. 
All independent parameters were held constant, and the CR was 
increased from 18 to 10096; the compressive strength grew as the cement 
replacement increased. The SiOz (96) was then increased from 30.5 to 
62.5496 while the CA, S, C, FA, CR, w/b, CaO, and t remained constant. 
According to the model results, increasing SiO? causes a gradual in- 
crease in compressive strength, while CaO (96) ranges between 0.3 and 
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Fig. 10. Comparison between measured and predicted CS using MLR model for training and testing dataset. 
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Fig. 11. Selecting optimum ANN based on RMSE and MAE. 


D. Kakasor Ismael Jaf et al. Construction and Building Materials 400 (2023) 132604 


Hidden layer 
(19 hidden neurons) 


Output layer 


Fig. 12. Optimal ANN network structures containing one hidden layer and 19 hidden neurons. 
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Fig. 13. Comparison between measured and predicted CS using ANN model for training and testing dataset. 
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Fig. 14. The effect of CaO and SiO» content of fly ash on the compressive 
strength of concrete. 


24% but contributes a reduction above that. To determine the influence 
of CaO (96), all other parameters were fixed, including CA, S, C, FA, CR, 
w/b, and SiO», and the CaO (96) varied from 0.3 to 31.996. Concrete 
compressive strength improved as the CaO (96) increased, but only when 
the cement replacement was between 52 and 100% and dropped below 
that. Fig. 14 depicts the performance of CaO and SiO». 


5. Evaluation of developed models 


The study attempted to determine the effect of two major chemical 
compositions of fly ash, CaO, and SiO», on the compressive strength of 
cement-based concrete modified with various types and amounts of fly 
ash. The experiment includes forecasting concrete compressive strength 
using four alternative models; FQ, NLR, MLR, and ANN. Each model 
supplied a formula based on several mathematical parameters. To 
evaluate the performance of each constructed model, various assessment 
criteria were applied. 

According to the R?, RMSE, and MAE statistics, the ANN model has the 
maximum accuracy and reliability for prediction using the training and 
testing dataset. Table 3 summarizes the statistical criteria outcomes for 
the created models. The ANN model has an R? of 0.987, RMSE of 3.3 MPa, 
and MAE of 2.662 MPa based on the training dataset, as well as R? of 
0.986, RMSE of 3.52 MPa, and MAE of 2.96 MPa for the testing dataset. 

Furthermore, more data are along the Y — X line for the ANN model, 
which has an error line of 10 to —109^ for the training dataset, referring 
to the fact that 90% of the data are between 0.90 and 1.1 (predicted CS/ 
measured CS). The second-ranked model is the FQ model; it has an R? of 
0.94, RMSE of 4.627 MPa, and MAE of 3.496 MPa for the training 
dataset, and R? of 0.887, RMSE of 4.57 MPa, and MAE of 3.457 for the 
testing dataset. The FQ model has an error line of + 15 for the training 
dataset. The NLR and MLR model have error lines of + 30, and + 25, 
respectively, indicating the model's poor performance. Fig. 15 compares 
created models based on the testing dataset; model values are within the 
20 and + 30% error lines. 

The training dataset for the models based on three different CS 
ranges was plotted, and the RMSE values were provided. Table 4 shows 
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Fig. 15. Variation of measured and predicted CS for developed models using 
the testing dataset. 


Table 4 
Root mean squared error (RMSE) for the developed models in different ranges of 
compressive strength. 


Compressive No. Models Best Model 
Strength range of FQ NIR MIR ANN Performance 
(MPa) data 

5 to 34 63 3.731 7.723 6.335 3.576 ANN 

35 to 64 78 4.312 8.231 10.599 3.16 ANN 

65 to 95 24 7.087 10.21 10.686 2.616 ANN 


an overview of the findings. There were 63 datasets for the strength 
range of 5 to 34 MPa, 78 in the range of 65 to 95 MPa, and 24 data for the 
65 to 95 MPa. According to the findings, the ANN model was more 
appropriate than other models for all CS ranges. The model provided an 
RMSE of 3.576, 3.16, and 2.616 MPa for 5 to 34, 35 to 65, and 65 to 95 
MPa, respectively. 

The same models were employed to identify the ideal specimen 
shape and size. The dataset for each size specimen was modeled based 
on training and testing datasets. Table 5 shows RMSE values for different 
models. According to the table, based on a satisfied number of data, the 
cylinder specimens with dimensions of 100*200 mm have the lowest 
RMSE value, 0.124 MPa, for the training dataset using the ANN model. 
Although, the cylinder specimens with 150*300 mm dimensions have 
the lowest value of RMSE for the testing dataset, which is 0.01 MPa. 

Furthermore, the created models using the training and testing 
dataset were also evaluated by comparing the OBJ function and SI 
values. The highest value of SI is 0.104 for training, and the ANN model, 
Fig. 16, concluded 0.074 for testing. The model also has the lowest OBJ 
function compared to the other models, indicating that the model is 
more accurate in forecasting the compressive strength of concrete 
modified with fly ash. The model has an OBJ value of 3.09 and 1.0 MPa 
based on the training and testing dataset, respectively, Fig. 17. 
Although, the highest a-20 index value was observed for the MLR model; 
104.4% for the training and 104.8% for the testing dataset, Fig. 18. 


Table 3 
Summary of performance evaluation parameters for the developed models. 
Models Fig. (No) Eq. (No.) Training Testing Ranking 
R? RMSE (MPa) MAE (MPa) R? RMSE (MPa) MAE (MPa) 
FQ 8 12 0.94 4.63 3.496 0.887 4.57 3.457 2 
NLR 9 13 0.816 8.36 6.698 0.47 9.62 7.522 3 
MLR 10 14 0.79 9.22 7.199 0.62 8.25 6.665 4 
ANN 13 15 0.987 3.3 2.662 0.986 3.52 2.96 1 
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Table 5 


Root mean squared error (RMSE) of the developed models using different specimen sizes and conducting. 
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Fig. 16. Comparison of developed models using (a) SI and (b) MAE. 
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Fig. 17. Comparison of developed models using the OBJ function. 


In addition, the created models were compared using residual error, 
as shown in Fig. 19. The residual error value is obtained by subtracting 
the expected compressive strength from the measured compressive 
strength. The results show that the FQ model has the lowest error value, 
ranging from —16.58 to + 17.14 MPa. At the same time, the residual 
error for the NLR, MLR, and ANN models is —17.81 to + 24.64 MPa, 
— 24.61 to + 24.85 MPa, and —6.80 to + 8.28 MPa, respectively. 


6. Sensitivity investigation 


A sensitivity analysis was performed to evaluate the influence of each 
independent parameter on the value of modeled dependent variable 
[32,44,56,57]. To determine the sensitivity index of each parameter on 
the developed model, one parameter was extracted each time for the 
training data. The ANN was used for the sensitivity investigation as it 
was observed as the best model for predicting the compressive strength 
of concrete compared to the other conducted models. All R, RMSE, and 
MAE values were reported for each trial. The sensitivity results are 
shown in Table 6. The findings indicate that the curing time is the most 
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significant and influential parameter for predicting the compressive 
strength of concrete, Fig. 20. 


7. Conclusions 


To find an accurate and reliable model to predict the compressive 
strength of cement concrete modified with different fly ash types and 
quantities, 236 data samples for fly ash-modified cement concrete with 
different mixture proportions, CaO and SiO» quantity of fly ash, aggre- 
gate content, cement content, cement replacement, water-to-binder 
ratio, and curing time were collected from previous researches. Ac- 
cording to the collected data and the result of four different model ap- 
proaches, the following can be drawn: 


The cement replacement percentage ranged between 18 and 100%. 
The compressive strength of concrete gradually increased when the 
replacement (%) was increased. 

The data collected from various research studies show that CaO (%) 
varied between 0.3 and 31.9%. SiO; (96) ranged between 30.5 and 
62.54%. 

Increasing CaO (%) in fly ash caused an increase in compressive 
strength only when the cement replacement ranged between 52 and 
100%. The CS also improved with increasing SiO2 (96), where; the 
CaO (96) varied between 0.3 and 2496. 

Based on multiple assessment criteria such as R?, RMSE, and MAE, 
the ANN model showed the best accuracy and performance for pre- 
dicting the compressive strength of concrete. The model had the 
highest value of R> which was 0.987 and 0.986 for training and 
testing datasets, respectively. The lowest value of RMSE and MAE 
was found as 3.3 and 2.66 for training, and 3.53 and 2.96 for testing, 
respectively. 

Related to other statistical tools, such as SI and OBJ, the ANN model 
was first ranked. The SI and OBJ function values were 0.104 and 
3.09 MPa for training and 0.074 and 1.0 MPa for the testing dataset, 
respectively. Although the highest a-20 index was observed for the 
MLR model, the value was 104.496 for the training dataset. 
According to the shape and size of specimens, the cylinder specimens 
having 100*200 mm were obtained for the best performance using 
the ANN model. 
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Fig. 18. Comparison of developed models using a20-index. 
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——ANN — FQ Training Data = 165 
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Fig. 19. Variation between measured and predicted CS for developed models based on residual error (a) ANN and FQ, (b) ANN and NIR, and (c) ANN and MLR. 
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Training Data = 165 
——ANN ——MLR Testing Data = 71 
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Fig. 19. (continued). 


Table 6 

Sensitivity analysis using the ANN model applied to the training dataset. 
No. Combination Removed Parameter R? RMSE (MPa) MAE (MPa) Ranking based on RMSE and MAE 
1 CA, S, C, FA, CR, w/b, CaO, SiOz, t - 0.987 3.30 2.662 - 
2 S, C, FA, CR, w/b, CaO, SiOz, t CA 0.987 3.984 3.254 3 
3 CA, C, FA, CR, w/b, CaO, SiO», t S 0.989 3.045 2.497 7 
4 CA, S, FA, CR, w/b, CaO, SiO», t C 0.990 3.800 3.165 4 
5 CA, S, C, CR, w/b, CaO, SiOz, t FA 0.989 3.127 2.514 6 
6 CA, S, C, FA, w/b, CaO, SiO», t CR 0.990 3.800 3.165 4 
T CA, S, C, FA, CR, CaO, SiO», t w/b 0.976 4.424 3.446 2 
8 CA, S, C, FA, CR, w/b, SiO}, t CaO 0.990 2.800 2.217 8 
9 CA, S, C, FA, CR, w/b, CaO, t SiO2 0.989 3.523 2.824 5 
10 CA, S, C, FA, CR, w/b, CaO, SiO2 t 0.876 10.450 8.109 1 
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Fig. 20. The percentage contribution of input variables in predicting compressive strength of concrete using the ANN model. 
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The ANN model was applied to different CS ranges of concrete. The 
best performance of the model was observed in the range of 35 to 64 
MPa. 

Sensitivity analysis illustrates the most effective parameter for pre- 
dicting the compressive strength of concrete: the curing time. 
Using multiple models provides construction professionals with 
broader information and insights. This enhances decision-making 
processes by considering different perspectives and approaches, 
leading to more informed and effective decisions throughout the 
project lifecycle. 

Employing multiple models allows for the validation and cross- 
verification of results. By comparing the predictions and outcomes 
of different models, construction practitioners can assess the accu- 
racy and reliability of the models in different scenarios, increasing 
confidence in the results. 

The construction industry is diverse, and no single model can address 
all situations. Applying multiple models acknowledges this diversity 
and provides the flexibility to choose the most appropriate model for 
specific projects, materials, or contexts. It enables the construction 
industry to adapt to evolving needs and challenges. 

The use of multiple models encourages continuous improvement in 
construction practices. By comparing and evaluating different 
models, areas for improvement can be identified, leading to ad- 
vancements in modeling techniques, data collection, and analysis 
methodologies. This fosters innovation and progress within the 
industry. 

Discuss the advantages of machine learning models in the context of 
compressive strength prediction. These may include their ability to 
capture complex relationships, handle large datasets, adapt to 
different scenarios, and potentially provide better accuracy than 
traditional analytical models. 
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